Goto

Collaborating Authors

 residual block






78ed45281dd746a265fff16ff75a02e5-Paper-Conference.pdf

Neural Information Processing Systems

Unfortunately, these theoretical results cannot well explain the empirical successes of deep learning well, as they require the model size tobenolargerthan O(n)(thegeneralization boundsbecomevacuousotherwise).